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ABSTRACT 

■ - - . We consider signals and operators in finite dimension which have sparse time-frequency represen- 
tations. As main result we show that an ^-sparse Gabor representation in C n with respect to a 
random unimodular window can be recovered by Basis Pursuit with high probability provided that 
S < Cn/ login). Our results are applicable to the channel estimation problem in wireless com- 
munications and they establish the usefulness of a class of measurement matrices for compressive 
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1. INTRODUCTION 



Efficient algorithms aiming at the recovery of signals and operators from a restricted number of 
measurements must be based on some a-priori information about the object under investigation. In 
a large body of recent work, the signal or operator at hand is assumed to have a sparse representation 
in a given dictionary. A typical example in this realm is the recovery of vectors that are sparse in the 
Euclidean basis, that is, of vectors which have a limited number of nonzero components at unknown 
locations. Such a vector is to be determined efficiently by a small number of linear measurements 
■ which are given by inner products with appropriately chosen analysis vectors. 

The difficulty in this body of work lies in the fact that sparsity conditions as those mentioned 
^ ■ above define nonlinear subspaces of linear signal or operator spaces. To circumvent a combinato- 
rial and therefore unfeasible exhaustive search, efficient alternatives such as £i-minimization (Basis 

* * i 

Pursuit) and greedy algorithms such as Matching Pursuits have been proposed in the sparse rep- 
resentations and compressed sensing literature, see, for example, [8j [T21 El El 121 1221 123 ESI El]. In 
compressed sensing one commonly uses linear random measurements for the recovery of a sparse 
signal with high probability. So far, mainly random Gaussian, Bernoulli and partial Fourier mea- 
surements have been considered successfully H21 El ES, EQ]- A typical result states that a signal 
of length N with at most S non-zero entries can be recovered from n randomly selected samples of 
its Fourier transform with high probability provided S < Cn/log(N) [5l [27]. 
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In this paper, we consider sparse representations in terms of time-frequency shift dictionaries, 
and investigate recovery conditions similar to the ones for Gaussian, Bernoulli and Fourier measure- 
ments. Here, T k denotes the cyclic shift respectively translation operator and Mg the modulation 
operator respectively frequency shift operator on C n , defined by 

(T k h) q = h k+q modn and (M e h) q = e 2 ^ n h q . (1.1) 

Then 7r(A) = M{T k , A = (k,£), is a time-frequency shift and the system {vr(A) : A G Z n xZ n }, 
Z n = {0, 1, ... ,n — 1}, of all time-frequency shifts forms a basis of C nxn [23l [21]. For a non-zero 
vector g, the so-called window, the set 

{7i(X)g: AGZ„xZ n } (1.2) 

is called a Gabor system [18] and the matrix G C nxn2 whose columns are the members n(X)g, 
A G Z n xZ n of a Gabor system is referred to as Gabor synthesis matrix. The Gabor system given 
in (11.21) is a tight frame in C n whenever g ^ (23J, [9] . 

A vector x is called ^-sparse if it has at most S non-zero entries; formally ||x||o := |suppx| = 
#{A : x x ± 0} < S. 

Our analysis of sparsity in conjunction with time-frequency shift dictionaries addresses the 
following, clearly equivalent objectives. 

Objective 1. Determine the coefficient sequence of a vector 

y= X ^( X )9 (1-3) 

Aez n xz n 

that is known to have a sparse representation in the Gabor system {ir(X)g : A G Z n xZ n } with 
window g ^ 0. Clearly, the representation (II .3p is redundant; given y the coefficient vector x is not 
unique and it is a non-trivial problem of computing efficiently the sparsest representation of y. 

If g is well localized in time and frequency, then the sparse coefficient vector x can be seen to 
describe the time-frequency content of any signal y = ^/ g x [18]. Note that the windows (II. 7p . (11.81) 
considered in this paper are neither well localized in time nor in frequency. 

Objective 2. Establish the applicability of \l/ g as measurement matrix for compressed sensing, 
that is, consider the rows of \l/ 5 as measurement vectors, in the classical strategy of efficiently 
determining a signal x which is sparse in the Euclidean basis. In short, the aim is again to recover 
x from y = \l/ g x whenever ||x||o can be assumed small. 

The window vector g used to achieve our main results Theorems 12.11 and 12 . 31 is chosen at random 
( EEHD , that is, ty g G C nxn depends on n independent random variables as compared to nxN 
independent random variables in the case of Gaussian or Bernoulli n x N measurement matrices 
[3 12]. Note that our results apply also to n x iV measurement matrices, iV < n 2 , that are obtained 
by removing n 2 — N columns from \l/ 9 . 

Further, the structure of \l/ 9 allows for fast Fourier transform based matrix vector multiplication 
algorithms [32] (in contrast to unstructured Gaussian or Bernoulli random matrices). This leads to 
efficient implementations of ^-minimization methods [1] . 
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Objective 3. Identify from a single input output pair (g, Tg) the coefficient vector x of an operator 

r= ^(A), ( L4 ) 

where T is assumed to have a sparse representation in the system of time frequency shift matrices 
{vr(A) : A G Z n xZ n }. 

In short, the task at hand is to identify T £ C nxra , or equivalently x, from its action y = Tg on 
a single vector g. Writing 

y = T 9= X ^( X )9 

Aez n xz n 

with unknown but sparse x, we observe the equivalence of this objective with Objectives 1 and 2. 

This objective falls in the realm of what is known as channel operator estimation / identification 
in communications engineering, and, indeed, (11.41) is a common model of wireless channels [31 d3 
[TUl [25] and sonar [3TI 124"] where physical considerations often suggest that x is rather sparse. First 
results were obtained in [26], on which we will improve here. Further, using multiple input output 
pairs for the efficient recovery of channel operators is discussed in [26]. Sparsity constraints in the 
dictionary of time-frequency shifts have also been considered for radar applications |2Uj . 



In the following, we will phrase our results in terms of Objective 1, namely, we assume that y is 
given and has an unknown S-sparse representation (11.31) in a given Gabor system (11. 2p with S < n. 

A natural strategy to recover the corresponding coefficient vector in this setup consists in seeking 
the vector x with minimal support consistent with y\ in other words solving the ^-minimization 
problem 

min ||x||o subject to ^/ g x = y. (1.5) 

X 

Unfortunately, this problem is NP hard in general [UJ, and hence, is not feasible in practice. In 
order to avoid this computational bottleneck, several alternative reconstruction methods have been 
suggested as mentioned above. We will concentrate here on Basis Pursuit, which seeks the solution 
of the convex problem 

min ||x||i subject to ^/ g x = y, (1.6) 

X 

where = X^Aez 2 I^aI is the ^-norm of x. This problem can be solved with efficient convex 
optimization techniques [H [H [13]. Of course, the hope is that the solution of (11.61) coincides with 
the solution of the ^-minimization problem (II. 5p . It is the goal of this paper to make this rigorous. 

So far we did not specify the window g in (11.21) . In [26J we proposed to work with the Alltop 
window g A [TJ [33] with entries 

< = ^e 2 ^, g = 0,...,n-l, (1.7) 
and with the randomly generated window g R with entries 



0f = -L:„ g=0,...,n-l, (1.8) 
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where the e q are independent and uniformly distributed on the torus {z G C, \ z\ = 1}; in other 
words, g R is a normalized Steinhaus sequence. The Alltop window will only be used for prime 
n > 5. Although both windows seem to be a bit unfamiliar in terms of time- frequency analysis due 
to their lack of time-frequency concentration (they are actually completely unlocalized in both time 
and frequency), they may perfectly be applied to the problem of identifying a sparse operator T of 
the form (jl.4p in Objective 3 [26J. 

In |26j . the following theorem concerning the recovery of sparse time-frequency representations 
in terms of g A and g R was shown. 

Theorem 1.1. 



(a) Let n > 5 be prime and g = g A be the Alltop window defined in fll. 7| ). If S < v/ ^ +1 then Basis 
Pursuit given in (11.51) recovers all S-sparse x from y = ^ g x. 

(b) Let n be even and choose g = g R to be the random unimodular window in U.8\) . Let t > 
and suppose 

1 / n 1 

S ~ 4y 21ogn + log4 + t + 2' ^ 

Then with probability at least 1 — er l Basis Pursuit (I1.5P recovers all S-sparse x from y = ^/ g . 



Theorem 1 1.1 1 is based on standard recovery results for Basis Pursuit which rely on the coherence 
of ty g [341 IT4"] . The coherence for g = g A was given in [55] . and the one for g = g R was estimated in 
[26] , see (14.51) . Although Theorem 11.11 shows that recovery guarantees can be given, the conditions 
on the maximal sparsity S are quite restrictive; S has to be as small as of the order of \J~n or even 
y/n/ log(n). 

Passing from a worst case analysis to an average case analysis in the sense that the support set 
of x and the signs of its non-zero coefficients are chosen at random, it is possible to apply recent 
results of Tropp [36J to show that recovery can be ensured with high probability provided 

n 

S<C——— (1.10) 

for some constant c where u = 1 in the case of g A and u = 2 in the case of g R . For a precise 
formulation of these results, see Theorem 2.5 in [26] . 

In this paper we will work with the randomly generated window g R and gradually improve 
conditions fll .Qp and (11.101) to S < Cn/ \og(n), while removing the randomness assumption on the 
coefficients x. It seems rather difficult to perform a similar task for the deterministic Alltop window 

9 A 

The paper is organized as follows. In Section [2] we state our two main results on recovery of 
sparse time-frequency representations, namely Theorems 12.11 and 12.31 Section [3] will deal with the 
estimation of the smallest and largest singular value of a submatrix of for g = g R , which plays a 
central role in the proofs of Theorems 12.11 and 12.31 In Section H] we prove Theorem 12.11 on recovery 
of sparse coefficients x with random phases; while Section [5] contains the proof of Theorem 12.31 on 
the recovery of deterministic sparse coefficients x. 



4 



Throughout the paper || • || p denotes the usual £ p -norm on sequences, while || • \ \p-+ q is the operator 

norm from £ p to £ q , and, for brevity || • || = || • 1 1 2 ^2 - The Frobenius norm of a matrix A is defined as 

\\A\\f = \J r Ti(A*A) ) where Tr is the trace. Furthermore, P(i?) denotes the probability of an event 
E and E means expectation. 



2. STATEMENT OF RESULTS 

Our results are concerned with the recovery by Basis Pursuit (11.61) of sparse time-frequency rep- 
resentations (11. 3p with the randomly generated window g = g R given in fll.Sp . We present a first 
result for deterministic support sets, that is, for every possible support set, but random phases of 
the coefficient vector x\ and a second result, Theorem 12.31 for deterministic x. 

Theorem 2.1. Let n be even, and let A C Z n x Z n be of cardinality |A| = S. Let x with 
supp (x) = A be such that on A the random phases (sgn(xA))AeA are independent and uniformly 
distributed on the torus {z G C, \z\ — 1}. Let a > 8. Choose the window g = g R as in U.8\) . that is, 
with random entries independently and uniformly distributed on the torus {z G C, \z\ = 1}. Then 
with probability at most 

2 <- 2 - s > «"» (-re) + CSew (-is*) + 

Basis Pursuit U.6\) fails to recovers x from y = ^ g x. Here, the constant C ~ 1.075. 
Remark 2.2. Note that the probability estimate above becomes effective once 

n > maxjlGeS* log(CS'), 645 log(n) log(2n 2 )}, 

or even simpler, if S < Cq log "^ n ^ for appropriately chosen Cq. 

The restriction to n even was made for the sake of simple exposition; a similar result holds also 
for n odd (compare also to Theorem 5.1 in [26]). 

Recovery is also possible for deterministic sparse coefficients. The corresponding proof is more 
involved, however. 

Theorem 2.3. Assume x is an arbitrary S -sparse coefficient vector. Choose the random unimodu- 
lar Gabor window g = g R defined in ( fi.gj) . that is, with random entries independently and uniformly 
distributed on the torus {z G C, \z\ = 1}. Assume that 

n 

S<C-—~- (2.1) 
log{n/e) 



for some constant C (see Remark 2.4)- Then with probability at least 1 — e Basis Pursuit U.6\) 
recovers x from y = ^x = ^> 9 x. 

Remark 2.4. From the proof of Theorem \2.3\ one can deduce information about the constant in 
Ii2.1\) . Indeed recovery is ensured provided 

n > max{C 1 S\og(n 2 /E),C 2 S(\og(S 4 /e) + C 3 )} 
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with C\ = 273.5, C<i = 64.1 and C3 = 8.35. Hence, the constant of Theorem \2.1\ is better than those 
in Theorem \2.3[ but this improvement comes at the cost of a worse exponent at the logarithm and 
of assuming random phases sgn(x\). 

Numerical experiments illustrating our recovery results were already given in [26]; clearly, they 
can only indicate the average case behaviour rather than the worst case behaviour covered in 
Theorem 12.31 These experiments suggest that most 5-sparse signals can be recovered provided 
S < 2Tog(n)- ^° Theorem 12.31 seems to indicate the right asymptotic order n/log(n), but the 
constants are likely not optimal. 

We note once more that both theorems can be interpreted as compressed sensing type results 

2 2 

on recovery of S-sparse vectors in C n from n measurements with \l/ g G ^yixn pi a yj n g the role of 
the (random) measurement matrix as described in Objective 2. Also, both results can be applied to 
identify matrices which have a sparse representation in the basis of time-frequency shift matrices 
as described in Objective 3. 

Furthermore, Theorems 12. II and 12.31 hold literally (including their proofs) when we pass from Z n 
to time-frequency analysis on an arbitrary finite Abelian group; in particular, on multi-dimensional 
versions Z^ with d > 1 where n would be replaced by n d in all of the statements. 



3. WELL CONDITIONED SUBMATRICES OF GABOR SYNTHESIS MATRICES 

It is crucial for sparse recovery that small column submatrices of measurement or synthesis matrices 
such as \1/ g are well-conditioned. Before proceeding to the proofs of our main Theorems 12.11 and 12.31 
we will deal with such an analysis in this section. 

Throughout the rest of the paper we let ^ = ^ g G C nxn with g = g R being the randomly 
generated unimodular window described in (11.81) . For A C Z n xZ n and A G (£ nxn we denote by A\ 
the matrix consisting only of those columns indexed by A G A. 

Theorem 3.1. Let e,5 G (0, 1) and |A| = S. Suppose that 

S < 5 " n (3 1) 

~ 4e(log(S/e) + c) 1 ' ; 

with c = log(e 2 /(4(e — 1))) ^ 0.0724. Then \\I A - *X*a|| <^ with probability at least 1 - e ; in other 
words the minimal and maximal eigenvalues of satisfy 1 — 5 < A m i n < A max < 1 + 5 with 

probability at least 1 — e. 

Remark 3.2. Assuming equality in condition ^3.1\) and solving for e we deduce 

P(l|/A " n*A|l > S) < j^Sexp = CSexp (3.2, 

with C w 1.075. 

In the following we will develop the proof of Theorem 13.11 
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3.1. Expectation of a Frobenius norms 

We set H = — I a- An important step towards Theorem 13. II is to estimate the expectation of 

the Frobenius norm of powers of H. Indeed having accomplished this task one may use Markov's 
inequality, the fact that the Frobenius norm majorizes the operator norm, and the fact that H is 
self-adjoint to obtain 

P(||/ A - ^a^aII >S) = P(||#|| > 5) = P(||#|| 2m > 5 2m ) < <T 2m E[||iJ|| 2m ] 

= <T 2m E[||# m || 2 ] < 5- 2m E[\\H m \\ 2 F ] = <T 2m E[Tr# 2m ]. (3.3) 

We will use the following concept to estimate E[Tr(if 2m )]. 

Definition 3.3. The associated Stirling number of the first kind, denoted by d2(m,s), is the 
number of permutations of m elements which involve exactly s disjoint cycles and where each cycle 
has at least 2 elements. 

The associated Stirling numbers satisfy the following recursion [291 P- 75] 

G?2(w+1, s) = m[d 2 (m, s) + d 2 (m— 1, s— 1)], l<s<m/2, (3.4) 

with boundary conditions 

d 2 (0,0) = l, d 2 (m,0) = 0, d 2 (m,s) = 0, m>l,s>m/2. (3.5) 

Equipped with this tool, the desired expectation of the Frobenius norm in (13.31) can be estimated 
as follows. 



Lemma 3.4. If S = |A| andm even then 

^c/ 2 (m, S ) (|) . (3.6) 

s=l 



m/2 



Proof. Note that for Xj G A, we have 
Tj _ / (K(\i)g,Tr(\ 2 )g)i if Ai ^ A 2 , 

- H Ai,A 2 



0, if Xt = A 2 , 

H liM = ^ H ^M H \ 2 m= E (7r(Ai)^,7r(A 2 )^) (7r(A 2 )^,7T(A 3 )^) , 

A2 A2^Ai,A3 

Km = Y, H l,Xs H >«M= E E (vr(A 1 )^,7r(A 2 )^)(7r(A 2 )^,7r(A3)^)(7r(A3)^,7r(A 4 )^) 

A3 A3^A4 A2^Ai,A3 

and, in general, 

H Z,x m+1 = E E -- E (^(Ai)^7r(A 2 )^(7r(A 2 )^7r(A 3 )^)---(7r(A m )^7r(A m+1 )^). 

A27^Ai,A3 A3^A4 \ m jL\ m+1 
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Consequently, 



TiH m = £ £ E MAi)5,7r(A 2 )g)(7r(A 2 )^,7r(A 3 )^)---(7r(A m )^7r(A 1 )g> 

Ai A 2 ^Ai,A 3 A 3 ^A 4 A m ^Ai 

£ M^i)^ 71 "^)^ (7r(A 2 )^,7r(A 3 )^) ■ ■ • (7r(A m )p,7r(Ai)p) . 



Ai,...,A m eA 
Ai^A 2 ^A 3 ^-^A m ^Ai 



Linearity of E implies that E[TrH m ] = £ Al#Aa#A , # .„ #WAl E *i,...,K. where 

4 A m = E [(tt(Ai)^, 7r(A 2 )#) (tt(A 2 )^, 7r(A 3 )<?> ■ ■ ■ (ir(X m )g, tt(Ai)p)] . 



(3.7) 



We denote A Q = (A; a , £ a ) with £ Q G Z n , a = 1, . . . , n. Applying once more linearity of E to the 
inner products in (13.71) we obtain 



■l 2 )/n e 27rij 2 (l2-e 3 )/n e 2mjm {Im-lxj/n 



31 32 

•E 



[gUi-ki)g{ji-k 2 ) g{32-k 2 )g{j2-h) • • • g{jm-i-k m -i)g{j m -i-k m ) g{j m -k m )g{j m -ki) .(3.8) 

Here and throughout the remainder of the paper, addition and subtraction of indices ji — k\ etc. is 
understood modulo n. 

The independence of the g(j) implies that the summands in ( 13. 8p factor into a product of 



expectations over powers of g(j) 7 s, namely, into factors of the form E g(j) Uj g(j)" 3 



, Uj, Vj G N. As 



Effl'O')] = and, by unimodularity of y/ng, E[g(j)g(j)] = ±, we have E g(j) Uj g(jp = if Uj ^ v s 

, n. We conclude that a summand appearing in (13.81) 



and E 



nUY'-gU) u > 



- n u i for j = 1, 

equals unless Uj = Vj for all j — 1, . . . , n. In other words, we have to consider only those cases 
where indices j a — k a and j a > — k a >+i in (13.81) coincide for some a, a' G {1, . . . , m}. 

Combining (13. 7p and (13.81) we obtain 



E[TiH r 



E 



J~J e 2irij a (la-la+i)/n . 



Ai,...,A m eA Jij2i...jm=l a=l 
Ai^A 2 ^A 3 ^-^Am^Aa 



j J g(ja-k a )g(j a -k a+ i) 



a=l 



.(3.9) 



So it remains to estimate how many of the |A|(|A| — l) m_1 (|A|— 2)-n m possible combinations of indices 
Ai, . . . , A m , ji, . . . ,j m contribute to (13.91) while taking into consideration that the exponential factors 
in (13.91) may lead to cancelations of nonzero summands as well. 

For the sake of simple illustration we start with an example. For given Ai, . . . , A m there could 
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exist m-tuples (ji, . . . ,j m ) with 



This scenario yields 



h-h = j2 -h (3.10) 

J2~h = ji-k 2 (3.11) 

33-h = 3A-h (3.12) 

ji-ki = j 5 -k 6 (3.13) 

jm-i-k m -i = jm-h (3-14) 

j m -k m = j 3 -k 4 . (3.15) 



1 t m _ _ l t m _ _ l 



g(ji-ki)g(j2-k 3 ) = -, g{h-k 2 )g{ji-k 2 ) = -, ... , g{j m -k m )g(i 3 -k 4 ) 

n n n 



and 
E 



g(ji-ki)g(ji-k 2 ) g{j2-k 2 )g{j2-h) . . . g{j m -i-k n ^.^g{j mr .x-k rt i) g(j m -k m )g(j m -ki) 
= n~ m . (3.16) 

Adding equations ( 13.101) and (13.111) above shows that this case, denoted in short by 

1 -> 2 -> 1, 3 -> 4 -> 5 -> . . . -> m-1 -> m -> 3, (3.17) 

is only possible if k\ = k 3 . Further, if this was the case, then we observe that there exists for 
each ji = 1, . . . , n and j 3 = 1, . . . , n exactly one choice of (m— 2)-tuple (j 2 , j 4 , j 5 , . . . ,j m ) satisfying 
equations ( 13.101) — ( 13. 15ft . thereby implying that ( 13.161) holds. But even these n 2 nonzero summands 
might cancel due to the phase factors present in ( 13. 8ft . respectively (13.91) . In fact, assuming that 
fci = k 3 holds and that the (m— 2)-tuple (j 2 , j±, . . . ,j m ) is chosen to satisfy (13.10p - (13.15j) . then 
( 13. 8p becomes 

j 1 j 3 . e 2Kij 3 (e3-e 4 )/n e 2wi(j 3 -k 3 +k 5 )(e4-e 5 )/n _ _ _ e 2m(j3-k 4 +k m )(e m ~£ 1 )/n 



i is (is -U -4 +• • • +im -I i ) In 



n J3 



where |cA l5 ...,A m | — 1- Recalling that Y^j=x e 2m ^^ n = whenever £ ^ 0, we see that the contributions 
in ( I3.16P cancel out unless l\ = £ 3 . In short, E\ lt _ } \ m only contributes if l\ = £ 3 in addition to 
k\ = k 3 . We conclude that \E Xl) ... t \ m \ = n~ m n 2 if Ai = A 3 and E\ 1 ^ m = otherwise. 

We will now generalize the consideration of the above example in order to derive the general 
estimate (13. 6ft . 
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Step 1. Fix Ai, . . . , A m . For E Xlt ...,A m in fl3.8j) to be nonzero, we must have that 

E g(ji-kx)g(ji-k 2 ) g{j2-k 2 )g{j2-h) ■ ■ ■ g(j m -k m )g(j m -ki) ^ (3.18) 

for some ji, . . . ,j m . We observed earlier, that this is only possible if each g(j) in (I3.18P can be 
paired with some g(j), so that E[g(j)g(j)] = K[\g(j)\ 2 ] = - becomes effective. For this to be the 
case, the indices 1, . . . , m must decompose into s cycles 



an —>• ai2 —>•... —> ain — > an, ... , a s i — > a s2 
r i + r 2 + • • • + r s = m , where, similarly to fl3.10p - fl3.15p . 



a s i, 



(3.19) 



Ja q i ~k aql 


Ja q 2 


ka q2 +l 


ja q2 ^a q 2 




-ka q3 +l 


— t 

) ^ a q(r q -l) 




-ka qrq +l 


Oiqr q _ ^org 


3a q \ ~ 


-k aq i+i 



(3.20) 



holds for q — 1, . . . , s. Further, (13.201) implies that whenever the s equations 



ka„i k a „ ~\~ ••• ~\~ k a , . -\- k D 



k aq2 +i + k aq3 +i + • • • + k a + i + k agl+ i, 



CXql 1 lv a q 2 o-l) ' ""IVq -u g 2Tl 1 -ujjti ""org 

g = 1, . . . , s, are satisfied, then any s-tuple (ju,jzi, • • • , jsi) £ defines a nonzero value for 
E g(ji-ki)g(ji-k 2 ) g{j2-k 2 )g{j 2 -h) ■ ■ ■ g(j m -k m )g(j m -ki) ^ 0. 



(3.21) 



(3.22) 



Still, as we saw earlier, the contributions of summands of the form (13.221) may cancel each other 
due to the phase factors in (13.71) . In fact, for j a , q = 1, . . . , s and p = 1, . . . , r q satisfying f)3.20p . 
we have 



E 



jl,—,jm=l 



II e 2 ^e~ 2m ^g( Ja -k a ) g(j a -k a+1 ) 



a=l 



s r q 



JctqpZ-OLqp Jotqp 



JH,j21,— ,jsl=l q=lp=l 
n s r q 

n - " 1 J^J J^J e 2n Va qp [iaqp-iaqp + l ) 

jll,j2l,—,jsX=l q=Xp=l 

n - " 1 e 2 ^* (■ 7CV <? 1 + ^«<JP ) {^qp-^qp + l) 

in j2iv,isi=i g=i p=i 



n e 



n- m c Al ,..., A 



hl,hl,—,3al=l 9=1 

I ^ e 2«i Q11 (e;L!^ 1p -^ 1p+ i) ) . . . ( V e^'-riCl^i 

\in=l / W=l 



-0 
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with |cA lv ..,A m | — 1 and k aql — 0, k aq2 — k aq2 — k aql , k aq3 — k aq3 -\-k aq2 — k aql , etc. Hence, E\ lt ... t \ m — 
if not simultaneously, we have 

f~a q i + £a q2 + • • • + @-a q ( ri _ 1 ) + ^a qri — ^a q2 +l + £a q3 +l + • • • + £a qrq +l £a ql +l 

for q — 1, . . . , s in which case Ex 1 ,...,x m = cx li ... t \ m n s ~ m . Consequently \Ex l7 ... t x m \ = n s ~ m whenever 
Ai, . . . , A m satisfies the s linear equations 

Aq: s1 + X aq2 + • • • + ^aq{r q -l) + ^a qrq = ^a q2 +l + ^a q3 +l + ■ ■ ■ + ^a qrq +l + ^a ql +l, (3.23) 

q = 1, . . . ,s. We conclude that of the |A|(|A|— l) m_1 (|A|— 2) • n m summands in (13.91) . only those 
need to be considered that correspond to a partition of the indices 1, . . . ,m of the j's into cyclic 
permutations and where the Ai, . . . , A m satisfy a corresponding system (13.231) of equations. This 
observation will be used to estimate E[Trif m ] in the following step. 

Step 2. We observe in addition to the above, that Ex u ...,x m with Aj = (ki,£i) contributes only if 
ki ^ k 2 , k 2 7^ k 3 , . . ., k m ^ k\. Indeed, ki = k i+ i = k implies £^ ^ £ i+ i by Aj ^ K+i- But since g 
and hence T^g is unimodular by assumption, the set {M^T^g, £ — 0, . . . , n — 1} forms an orthogonal 
system, and we have {M t .T k g , M ii+1 T k g) = 0, implying that £ou,...,A m — 0. The condition ki ^ k i+ i 
in turn implies that each cycle in (13.191) has at least two elements, as otherwise, (13.201) would lead 
to a contradiction. 

Now for each permutation with s cycles described by (13. 19ft we give an upper bound on the 
number of index tuples (Ai, . . . , A m ) satisfying the s equations (13. 23ft . To this end, we shall show 
that any s — 1 equations of the s equations f l3.23f) are linearly independent. 

First, note that each Xj appears on exactly one left hand side and on one right hand side of 
the equations (13.231) . Hence, a linear combination of these equations leading to the trivial equation 
= can be achieved involving only 0's and l's as coefficients, that is, by simply adding up some of 
the equations in (13.231) . But the fact that the right hand side of an equation consists exactly of the 
successor variables of the left hand side implies that a vanishing sum of equations in (13.231) must 
contain all variables on both sides. As all equations are non-trivial, this is achieved if and only if 
the sum is taken over all equations. Hence, the s equations (13.231) are linearly dependent, while any 
s — 1 equations are not. We conclude that the system (13.231) describes an m — (s — l)-dimensional 
subspace whose its intersection with A m has at most lAI" 1- ^ -1 ) elements. 

By definition of the associated Stirling numbers of the first kind there are d,2(m, s) permutations 
with s disjoint cycles of minimum length 2 of the index set {1, . . . , m}. Each of these permutations 
represent n s tuples (ji, . . . , j m ) and, at most |A| m ~( s_1 ) tuples (Ai, . . . , A m ) G A m satisfing f)3.23p . 
Each of these tuples of indices gives a contribution to (13.91) of absolute value at most n~ m . Finally, 
this yields 

m/2 

E[Tr# m ] < ^2d 2 {rn,s)\K\ m - {s ~ l) n s ~ m 

s=l 




and the proof of Lemma 13.41 is complete. 



□ 



3.2. Proof of Theorem \3A 



Given specific parameters n and S = |A| one may already obtain good estimates for the probability 
that || I \ — ^f\^\\\ < 5 by numerically minimizing the right hand side of (13. 6p over m e N and 
using ( 13. 31) . Theorem 13 .11 is proven by pursuing a similar strategy combined with an estimate of the 
numbers d 2 (m, s), compare also to [19] . 

We first claim that the associated Stirling numbers of the first kind satisfy the estimate 

d 2 (m+l,s) < (2m) m - s . (3.24) 

Indeed, (13.241) is true for m > 1 and s = or s > m/2 since then d 2 (m, s) = by (13.51) . It is also 
true for d 2 (2, 1) = 1. Now let m > 2 and suppose the claim is true for all d2(m',s) with m' < m 
and s > 0. Then 

d 2 (m + l,s) = m(d 2 (m, s) + d 2 (m - 1, s-1)) < m((2(m - l))^- 1 - 5 + (2(m - 2)) m - 2 -( s - 1 )) 
< 2m(2m) m - 1 ~ s = (2m) m " s . 

Now let 

m 

G 2m {z) := z- 2m Y,d2(2m,s)z s . (3.25) 
By LemmaE3l E[||if m || 2 ] < E[TrH 2m ] < SG 2m (n/S). Using the estimate (ET23jl we obtain 
G 2m {z) < z~ 2m ^(2(2m - i)fm-i-s z s <( — ) (Am)- 1 ^(z/4m) s 

3 = 1 \ Z J s = 1 

4m\ 2m v_i(2/4m) ,n+1 - (z/4m) 
— — (4m) 



z J z/Am — 1 

, , , [Am\ m 1 - Um/z) m 

4m - 1 — : ,' , \ - 3.26 
V -2 / 1 - (4m/ z) 



Now choose m = m z e N such that 4m z /z < a < 1, for instance 



m 2 



(3.27) 



Then 



G 2mz (z) < (4m,)- 1 - < — -. (3.28) 

1 — a 4(1 — a) 



We want to achieve P(||if || > 5) < e, which by (13.31) will be satisfied provided 

5~ 2m *S-^ <e 

4(1 - a) - 

for z = n/S. Assuming a < 5 2 the latter inequality is equivalent to 

S 



m z \og(5 2 /a) > log 
12 



4e(l - a) 



Plugging in z = n/S and m z given by (I3.27P we obtain 



an 

Is. 



log (5 2 /a) > log 



S 



4e(l - a) J ' 



Finally, choose a = <5 2 /e. Then the above inequality reduces to 

S 



5 2 n 


>log( 




_4eS_ 





s 4e(l - 5 2 /e) 

A straightforward calculation shows that the above equation is satisfied whenever 

4e { e S 
n > —b log 



5 2 & \4(l-5 2 /e) e 
Finally, the above inequality is implied by the assumption of Theorem 13.11 

Remark 3.5. Starting from the first inequality in A3. 28\) and proceeding analogously as in the 
previous proof one may deduce the slightly better but more complicated condition 

ensuring \\Ia — ^^aII < 8 with probability at least 1 — e. 

4. RECOVERY OF RANDOM SIGNALS 

Theorem 12.11 addresses the recovery of signals whose sparse coefficients in a Gabor expansion are 
chosen with random phases. Its proof is based on a recovery result due to Tropp [35] and Fuchs 
[T5] which is given in our framework as Lemma 14.11 below. 

Let ip\ = 7r(A)<7 be the column of \I> indexed by A. By R\x we denote the restriction of a vector 
x to the index set A. Furthermore, sgn(x) is the sign of a vector, that is, sgn(x)k = Xk/\ x k\ f° r the 
non-zero entries of x and sgn(x)k = else. 

Lemma 4.1. Suppose that y = ^x for some x with supp x = A. If 

\(yy> p , R A sgn(x))\ < 1 for all A, (4.1) 

then x is the unique solution of the Basis Pursuit problem A1.6]) . Here \I/ A denotes the Moore-Penrose 
pseudo-inverse of^\. 



4.1. Proof of Theorem EH 



We will use Lemma 14.11 in combination with Theorem 13.11 and an estimation of the coherence of 
\I/ given in [26J to prove Theorem 12.11 concerning recovery by Basis Pursuit of sparse signals with 
random phases. 

We aim at using the following Bernstein type inequality for a sequence of independent random 
variables having uniform distribution on the torus [36j Proposition 16], 



e -KU* 



PflV^ajl > u\\a\\ 2 ) < (4.2) 

1 K 
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for any k G (0, 1). By our assumption on the random phases e\ = sgn^), the scalar product on the 
left hand side of (14.1 1) is precisely of the above form with a = ^ A ip p . The 2-norm of this particular 
a can be estimated by 

w^Uph = iicn^-^pih^iKn^-iiin^iia. 



Now suppose that || I— ^^f\\\ < 5. The probability that this is the case is estimated by Theorem l3.ll 
Then || (^^a) -1 || < y— Furthermore, observe that 



\nn\2 = (^k^xmA < v 7 ^ 

V AeA / 



where p = max^A KV'A'jV'a)! denotes the coherence of Combining the above estimates yields 

W^MU < jz~s^^- ( 4 - 3 ) 

Now, the probability that recovery fails can be estimated by 
F(\(^\ijj p , R A sgn(x))\ > 1 for some p £ A) 

< P( |(^ A ^p,^ASgn(x))| > 1 for some p (£ A | p < -^=k\\H\\ < 8) 



+ P(^ > -=) +¥(\\H\\ > 5) 



a: 

Jn 

a. 

< ^P(|(M/ A Vp,^ASgn(x))| > 1 | p <-^=k\\H\\ < 5) 

+ F(p> -^=)+F(\\H\\>S). 
\/n 



Equation (14.31) implies that for u = ^JJif^ we have w||\I/ A ^p||2 < 1, so ( 14. 2[) gives 

F(\(^ p ,R A sgn(x))\>l\ p<^=k\\H\\<5) < (1 - k)" 1 exp (-^^^l^j . (4.4) 

In [261 Theorem 5.1] it was proven that 

P(/i > —=) < 2(1 - K!)- l n{n - 1) exp(-«/a 2 /2) (4.5) 



for any k' G (0, 1). Combining inequalities (14.41) . (14.51) and (13.21) . we obtain the following bound on 
the probability that recovery fails 



n 2 - S)(l - k)' 1 exp — k =^-~ +2(1-k')- x n(n - 1) exp(-/tV/2) + C^exp 



a 2 S J ' ' i y 11 ^ \ AeS 
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Choosing a = y / log(n CT ) with a > 8, k = k' = 1/2 and 5=1/2 the above expression equals 

£ 2 <" 2 - s > ex » ("s^iw) + ^"^ + ("iSs) • 

This completes the proof. 

5. RECOVERY OF DETERMINISTIC SIGNALS 

In this section we prove Theorem 12.31 As an auxiliary tool we first provide a general recovery 
lemma. 

5.1. A general recovery lemma 

The following lemma holds for any (random) matrix \F It is inspired by the analysis performed in 
[5] and [27]. 

Let A be a subset of the column index set of and A c its complement. Let E A = R* A be the 
adjoint of the restriction operator R\; clearly, E\ extends a vector outside A by 0. Further, we 
define 

H = ^* a ^a - I A and K = *tf A - E A . 
Observe that H = R A K. With this notation we have 

Lemma 5.1. Let x be supported on A with |A| = S. Let (3 > 0, k > 0, m G N and L t G N, t = 
1, . . . , m, be parameters such that 

a , = y r / Lt<l and _j^ < ^ s -3/2_ (51) 

^ r 1 — k 1 — t— n y ' 



i 1 — k 1 + a 



Then with probability at most 



K~ 2 



E [\\H m \\ 2 F ] +P~ 2m £ E E R A Y sgn(x)) p \^} (5.2) 
p eA c t=i 



Basis Pursuit fails to recover x from ^x. 

Proof. We reassemble the arguments from [2T1 E] for the reader's convenience. 
First, we address the recovery condition (14.11) . Let A be the support of x. Define 

P := ^^ A (^^ A )- 1 i? A sgn(x). 

Note that condition (14.11) in Lemma 14.11 is equivalent to 

||-Ra c -P||oo < l- 
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The vector P can also be written as 

P = (E A + K) (I A + H)~ l R A sgn(x), 

and, since R A cE A = we have 

R Ac p = R A cK(I A + HyiRASgnix). 

Let us look closer at the term (I A + if) -1 . By the Neumann series we can write (I A — (—H) m y l = 
I A + A rn with 

oo 

A m := (5-3) 

r=l 

Using the algebraic identity (1 - M) _1 = (1 - M m ) _1 (l + M H h M m_1 ) we obtain 

m— 1 



(/A + if)- 1 = (7 A + A m ) J] (-//)*. 



t=0 

Thus, on the complement of A, we may write 

f m—l 



R A .P = K{I A + A m )\^{-H) t] jR A sgn{x) 

= P« + P( 2 \ 

where 

P (1) := -Q m sgn(x) and P (2) := KA m R A (I + Q m ^) sgn(x), 

with 

m— 1 m 

Q m := -Y,K{-R A K) t R A = ^(-Oa)'. 



t=o t=i 



With this at hand, we can now proceed to estimate P(sup pgAc \P p \ > l). To this end let a, b > 
be numbers satisfying a + b = 1. Then 

P( sup \P p \ > l) < P ( {sup |P p (1) | > a} U {sup |P p (2) | > 6} J . (5.4) 

Clearly, 

P(|P P (1) | > a) < P ( ]T |((XP A )*sgn(x)) / ,| > a] =: F(Q p ), p e A c . (5.5) 



For P^ 2 -* we obtain 



SUp |Pp 2 ^| < ||P^||oo < ||^An|| (l + \\RaQ m—l 

sgn(x)|| ). (5.6) 

peA c 
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In order to analyze the term \\R^Q m -isgn(x) we observe that similarly as in (15. 5p 

P(|(Q m _isgn(x)) p | > a) < P ( jr \(K R A y sgn(x)) p \ > a) = P(ft p ), p E A c . 



Let us now focus on the operator norm appearing in (15. 6ft . It holds ||A||oo-»oo = sup^- Y^e 
Clearly, 

II K A II < II K\\ II 4 

J 1 Jim. no — >no no — >no W-^-rr 



Moreover, 



l^llno^no — |A| — S 



(5.7) 
(5.8) 



as K has S* columns and each entry is bounded by 1 in absolute value. 

Let us analyze A m using the Frobenius norm. Assume for the moment that 

\\H m \\ F < k < 1. 

Then it follows directly from the definition (15.31) of A m that 



(5.9) 



m\\F 



r=l 



< E ii" 

r=l 



< > K'' 



E 

r=l 



1 - K 



Moreover, since A m has |A| = 5* columns it follows from the Cauchy-Schwarz inequality that 

|A|^|(A m ) AiA ,| 2 < S\\A m f F . (5.10) 



|A»||oo-oo < SU P 



AeA 



So assuming (15.91) and ||Q m _iSgn(x)|| 00 < a, we can combine (15.71) . (15.81) and (15.101) to obtain 



\Pf ] \ < (l + a)S 3 / 2 



K 



peA c 1 — ft 



By assumption of the lemma 



K < 1 ~ Q C-3/2 



1 — k 1 + a 



1 + a 



g-3/2 



(5.11) 



and sup pgAc |Pp | < b under condition (15.91) as desired. Also it follows from (15.10 that k < 1 as 
5 > 1 without loss of generality (if A = then x = and ^-minimization will clearly recover x.) 

Using the union bound we obtain from (15.41) 

P(sup|P p |>l) < P| M{|pW|>a}U{||Q m „ 1 sgn(a;)|| 00 >a}U{||iJ m || F > K }) 
" eAc W= / 

< p( \Jn p U{\\H m \\ F >K}\ <^P(fi p )+P(||iJ m || F > K ). (5.12) 

\peA c / peA c 
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Markov's inequality now gives 

W(\\H m \\ F > k) = F(\\H m \\ 2 F > k 2 ) < K- 2 E[\\H m \\ 2 F }. (5.13) 
It remains to investigate P(Q p ). To this end let (3 t , t = 1, . . . , m, be positive numbers satisfying 

m 

and let Lf G N, i = 1, . . . , m. For p G A c , we have 

tm \ m 

J2\((KR A Ysgn(x)) p \>a) < £p(|((#i2 A )*sgn(z)),l > ft) 
t=i J t=i 

m m 

= ^P(|((^ A ) t sgn(x)) p | 2i ' > (3 2Lt ) < 5> [\((KR A Ysgn(x)) p \^] (3- 2L >, (5.14) 



t=i t=i 



where Markov's inequality was used to obtain the last inequality. Let us choose (3 t = j3 m / Lt , that 
is, f3- 2L * = This yields 

m 

P(0 P ) < /5" 2m ^E [|((ir^ A )*sgn(a;)) p | 2 ^] (5.15) 
t=i 

and the condition a = YlT=i fit reads 

a = J2fi m/Lt < L 



t=i 



This is precisely the first condition in (15. ip . Assembling (15.121) . (I5.13P and (I5.15P completes the 
proof. □ 

5.2. Estimate of an auxiliary expected value 

Lemma 15.11 suggests the investigation of the expected values appearing in (15.21) . As the ex- 
pectation E[||iy m |||i] was already estimated in Lemma |3.4[ we focus here on terms of the form 
E[\((KR A y S gn(x)) p \ 2L *]. 

Lemma 5.2. Let A c Z n x Z„ with |A| = S. Then for p A, 

(q\ 2tL tL 
- £d a (2iL, a )Q) . 
' s=l 

Proof. Note that (KR A y = K(R A Kf~ 1 R A = KH t ~ 1 R A . Denote a = R A sgn(x). Then for 
p ^ A we have 

((KR A Y)a) p = «P)9, K(\i)g) (A\i)g, n(\ 2 )g) ■ ■ ■ (n(\ t _i)g, ir(\ t )g) a(X t ) . 

Ai^A 2 ^A 3 ...^A t eA 



Furthermore, setting A " := X^ u := p A, u = 1, . . . , L, for notational brevity, we can write 



\((KR A fa) 



\1L 



\\,...,\\zA u=l 
A^,...,A^ l£ A 

Using linearity of expectation we obtain 



E\((KRaYct) p \ 2L 



r=l 



E S \{,-A 2 t LF \l,. 



,A? L 



(5.16) 



Ag i ,...,A^ 1 eA 



A 1 ,...,A i 



1 and 



where 5 A i A 2l does not depend on g. 

' L t 

II U« X r-^ <K U - l )9) {<K-i)9, <K U )9) 



F Ai,...,A^ 



E 



u=l r=l 



Let us write A" = (k?, If = /c" for r G {1, . . . , t}, w G {1, . . . , 2L} then necessarily ^ C 
due to the condition Ajf_i 7^ A" on the index set of the sum. Observe that this holds as well 
for r = 1 and r = t since Aq U_1 = A^ u = p ^ A. Due to the unimodularity of g we have then 
(7r(AJf_ 1 )5f, 7r(A")#) = and the corresponding F a i__ )A 2l does not contribute to the sum. Hence, in 
the following we may assume as in Section [3] that 7^ fc". 

As in (13. 9p we may write 



a;...a? 



E T 4;4,..,^0'i' 



i3t )Jk ;kl,...,k? L Uli ■ ■ ■ i3t ) 



(5.17) 



jl j2L = 1 



,-1 ,'2i_ 



with 



and 



Qlo;4,...,^ i 0lJ • • • > 



1 



2L « 



(5.18) 



tt=l r=l 



Jk ;k\,... ) k^ L {jli ■ ■ ■ ifi 1 ) 



E 



2L t 



\[X[9{3r - K-Mtt - K) 

_u=l r=l 

As discussed in Section [3j the independence of the g(j) implies that the expectation above factorizes 
into a product of expectations. However, we have to be careful again since some of the indices 
j" — and j", — kfi might equal the same number j. In this case one of the factors in the product 
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equals E[|g(j)| 2 ] = 1/n (or a higher power if more than two indices are equal). As in the proof of 
Lemma E31 we have to count such cases. Again, they necessitate that jl, ■ ■ ■ ,jf L must decompose 
into s sets 

{j ail ,ja 12 ,- ■ -,ja ln }, {j 

with r\ + r 2 + . . . + r s = 2tL, and for each q = 1, . . . , s we have 

ja q2 ~ k aq2 -i = j aq3 — k aq3 , 
ja qrq ~ k aqrq -i = ja ql — k aql . (5.19) 

Here, it is understood that = A;"_ 1 if a = (r,u). As done earlier in Lemma [3.41 we represent 
such a case by the s cycles 

an —>• ai 2 —>•... —> a ln -> a lb ... , a si -> a s2 -> • • • -> «sr s — > a sl . 

Now if 

X>^ = E^ ( 5 - 2 °) 
p=i p=i 

for all q = 1, . . . , s then any vector of indices (j an , ■ ■ ■ ,ja 3l ) £ {1, • • • , n} s gives 



Jk ;k\,...,k'* L {ili ■ ■ ■ )j't L ) 



n 



by setting the other indices according to (15.191) . Plugging this into (15.1 7p and (I5.18P we realize 
that these contributions are canceled out unless 

r q r q 

Evi = E^. ( 5 - 21 ) 
p=i p=i 

So we obtain n s non-zero contributions of absolute value n~ 2tL to F x i ) ... ) a 2£ 1 if an d only if 

r q r q 

^p- 1 = Yl Xa iv for all g = 1, . . . , s. 

p=i p=i 

Arguing similarly as in the proof of Lemma 13.41 we conclude that these s equations are linearly 
independent. However note that in contrast to the situation there we now have Aq U_1 = \ 2u = p ^ A. 
With similar arguments as in the end of the proof of Lemma 13.41 we finally obtain 

/Iain 2tL tL , v 

E\((KR A fa) p \ 2L < (^J J>(2*M (J^j . 

□ 
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5.3. Proof of Theorem |2J3| 

Applying Lemma I5TTI using the estimates of Lemmas 13.41 and 15.21 the definition of the function G2 m 
in (13.251) and |A C | < n 2 , we conclude that the probability of recovery failure is upper bounded by 

m 

k- 2 E [\\H m \\ 2 F ] +P~ 2m £ E E [\((KR A Ysgn(x)) p \ 2Li } 

peA c t=i 

m 

< K- 2 SG 2m (n/S) + n 2 f3~ 2m £ G 2tLt (n/S) (5.22) 

t=i 

provided that the conditions given in (15.11) hold. 

For specific S, n one may already use this estimate to compute an explicit probability bound by 
numerically minimizing over m and the remaining parameters. Following the analysis in [27] we 
provide an estimate, which is easier to interpret. 

We choose L t as m/t rounded to the nearest integer. It is then straightforward to deduce that 

tL t E {[2m/3] , [2m/3] +1,..., |4m/3j}, te{l,...,m}. 

Let z = n/S. Using (13.28}) we obtain 

^ a m ' a 2m/3 
> GotLtiz) < m max Go-m'iz) < m max — < m— - 

m'6{r2m/3l,...,L4m/3j} " m'e{ \2m/3] |4m/3j } 4( 1 - a) 4(1 - a) 

for any a < 1 with 4m'/ z < a for all m! G { [2m/3] , . . . , |_4m/3j }, which is the case for 

3az 



m = m. 
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(5.23) 



This yields 

^ 4(1 -a) 

Now choose 

a := /3V 3 / 2 . (5.25) 
Then the right hand side of (15.241) becomes n 2 m z which is less than or equal to e/2 if and 



only if 



m z - log ( 2 ,™^_ ^ ) > \og(n 2 /e). 



A numerical test shows that (3 = 0.47 is a valid choice and the corresponding a = Y1T=\ P m ^ Lt 
will always be less than 0.957. Assume for the moment that m z > M e N, M > 6. Since 
1 1 — > t^ 1 log( 2( - 1 ^^ ) is monotonically decreasing for t > 6 and a as in (15.251) . (3 = 0.47, we obtain 

, , rn z \ ( _ x ( m z \\ ( log(M(l - ot)^/2) \ 

m z - Iok — r = m z 1 - m, log — > ra z 1 



2(1 -«); z \ z & V 2 (i-a)y/~ V M 
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The elementary inequality [_y\ > for y > M yields 



m, 



3az 



3azM 3M - 

~16(M + 1) 16{M+lf 
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Altogether, the left hand side of (I5.24p is less than e/2 provided 



- = z>Q((3,M)- 1 \og(n 2 /e) (5.26) 



and m z > M, where 



Q(I3, M) := ™ (l - Mj£(l^!£^Vg) 

w ' ; i6(M + iy V M 

Taking M = 20 yields 

d : = Q(0.47,20) _1 « 273.5. 

Without loss of generality we may assume S > 1 (otherwise x = and there is nothing to prove). 
Then ( I5.26P requires at least n/log(n 2 ) > C X S > C\, and a numerical test reveals that necessarily 
n > 10000. The minimal choice z = C x log(10000 2 ) yields then m z = [3aCi log(10000 2 )/16j = 
21 > 20 = M, that is, our initial assumption m z > M is satisfied if 

n>C x S]og{n 2 le) (5.27) 

and, hence, this ensures n 2 (3~ 2m YlT=i ^2tL t { n /S) < e/2 as well. 

Now consider the other term SK~ 2 G2 m {n/ S) in the probability bound (I5.22p . We choose k such 
that there is equality in the second inequality of (15. ip . that is, 

(l_ Q)/( l + Q) g-3/2 1-a 3/2 

l + (l_ a )/(l + a )^3/2 - 2(1 + a) 

Together with (I3.28|) and the choice (15.231) (with z = n/S) we obtain 



-a~ ^ . / 2(1 + a) 



1 - a) / 4(1 - a) 
Requiring that the latter expression is less than e/2 is equivalent to 

) 2 

— a) 

As above assume for the moment that m z > M. Plugging in a from above yields 

M 3az 3M 



log(«->, > log(S 4 /e) + log (2 (1 _ (1 Q) t ( a i ) 



■ ~ M+ 1 16 16(M + 1) M 



It follows that 2 G 2mz {z) < e/2 if 
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As already remarked the choice (3 = 0.47 results in a < 0.957. Choosing M = 21 (m z > 21 will be 
ensured by (I5.27P anyway as shown above) gives 

z > C 2 (\og(S A /e) + C 3 ) (5.28) 

with C 2 « 64.1 and C 3 « 8.35. 

Since S 1 < n 2 , combining (I5.27P and (15.281) finally shows the existence of a constant C such that 

n > CS\og(n/e) 

ensures recovery with probability at least 1 — e. This proves Theorem 12.31 
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